• A pipeline architecture for feature-based unsupervised clustering using multivariate time series from HPC jobs 

      Enes, Jonatan; Expósito, Roberto R.; Fuentes Rodríguez, Jose; López Cacheiro, Javier; Touriño, Juan (Elsevier B.V., 2023-05)
      [Abstract]: Time series are key across industrial and research areas for their ability to model behaviour across time, making them ideal for a wide range of use cases such as event monitoring, trend prediction or anomaly ...
    • Accelerating binary biclustering on platforms with CUDA-enabled GPUs 

      González-Domínguez, Jorge; Expósito, Roberto R. (Elsevier Ltd, 2018)
      [Abstract]: Data mining is nowadays essential in many scientific fields to extract valuable information from large input datasets and transform it into an understandable structure. For instance, biclustering techniques are ...
    • Accelerating the quality control of genetic sequences through stream processing 

      Castellanos Rodríguez, Óscar; Expósito, Roberto R.; Touriño, Juan (Association for Computing Machinery, 2023)
      [Abstract]: Quality control of DNA sequences is an important data preprocessing step in many genomic analyses. However, all existing parallel tools for this purpose are based on a batch processing model, needing to have ...
    • Analysis and evaluation of MapReduce solutions on an HPC cluster 

      Veiga, Jorge; Expósito, Roberto R.; Taboada, Guillermo L.; Touriño, Juan (Pergamon Press, 2016-02)
      [Abstract] The ever growing needs of Big Data applications are demanding challenging capabilities which cannot be handled easily by traditional systems, and thus more and more organizations are adopting High Performance ...
    • Analysis of I/O Performance on an Amazon EC2 Cluster Compute and High I/O Platform 

      Expósito, Roberto R.; Taboada, Guillermo L.; Ramos Garea, Sabela; González-Domínguez, Jorge; Touriño, Juan; Doallo, Ramón (Springer Netherlands, 2013-12)
      [Abstract] Cloud computing is currently being explored by the scientific community to assess its suitability for High Performance Computing (HPC) environments. In this novel paradigm, compute and storage resources, as well ...
    • BDEv 3.0: energy efficiency and microarchitectural characterization of Big Data processing frameworks 

      Veiga, Jorge; Enes, Jonatan; Expósito, Roberto R.; Touriño, Juan (Elsevier BV * North-Holland, 2018-09)
      [Abstract] As the size of Big Data workloads keeps increasing, the evaluation of distributed frameworks becomes a crucial task in order to identify potential performance bottlenecks that may delay the processing of large ...
    • BDWatchdog: real-time monitoring and profiling of Big Data applications and frameworks 

      Enes, Jonatan; Expósito, Roberto R.; Touriño, Juan (Elsevier BV * North-Holland, 2018-10)
      [Abstract] Current Big Data applications are characterized by a heavy use of system resources (e.g., CPU, disk) generally distributed across a cluster. To effectively improve their performance there is a critical need for ...
    • BDWatchdogFaaS: A Tool for Monitoring and Analysis of Functions-as-a-Service in Cloud Environment 

      Framil de Amorín, Manuel; Carpente, Mario; Fraga, David; Enes, Jonatan; Expósito, Roberto R.; Taboada, Guillermo L.; Touriño, Juan (Universidade da Coruña, Servizo de Publicacións, 2023)
      BDWatchdog is a framework to assist in the in-depth and real-time analysis of the execution of Big Data frameworks and applications. BDWatchdog was originally developed to monitor Hadoop ecosystems deployed on serverless ...
    • Big Data-Oriented PaaS Architecture with Disk-as-a-Resource Capability and Container-Based Virtualization 

      López Cacheiro, Javier; Expósito, Roberto R.; Touriño, Juan; Enes, Jonatan (Springer Netherlands, 2018-12)
      [Abstract] With the increasing adoption of Big Data technologies as basic tools for the ongoing Digital Transformation, there is a high demand for data-intensive applications. In order to efficiently execute such applications, ...
    • BigDEC: A multi-algorithm Big Data tool based on the k-mer spectrum method for scalable short-read error correction 

      Expósito, Roberto R.; González-Domínguez, Jorge (Elsevier, 2024-05)
      [Abstract]: Despite the significant improvements in both throughput and cost provided by modern Next-Generation Sequencing (NGS) platforms, sequencing errors in NGS datasets can still degrade the quality of downstream ...
    • CUDA-JMI: Acceleration of feature selection on heterogeneous systems 

      González-Domínguez, Jorge; Expósito, Roberto R.; Bolón-Canedo, Verónica (Elsevier, 2020-01)
      [Abstract]: Feature selection is a crucial step nowadays in machine learning and data analytics to remove irrelevant and redundant characteristics and thus to provide fast and reliable analyses. Many research works have ...
    • Design and Evaluation of Low-Latency Communication Middleware on High Performance Computing Systems 

      Expósito, Roberto R. (2014)
      [Resumen]El interés en Java para computación paralela está motivado por sus interesantes características, tales como su soporte multithread, portabilidad, facilidad de aprendizaje,alta productividad y el aumento significativo ...
    • Design of Scalable Java Communication Middleware for Multi-Core Systems 

      Ramos Garea, Sabela; Taboada, Guillermo L.; Expósito, Roberto R.; Touriño, Juan; Doallo, Ramón (Oxford University Press, 2013-02-01)
      [Abstract] This paper presents smdev, a shared memory communication middleware for multi-core systems. smdev provides a simple and powerful messaging application program interface that is able to exploit the underlying ...
    • Design of scalable Java message-passing communications over InfiniBand 

      Expósito, Roberto R.; Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón (Springer New York LLC, 2012-07)
      [Abstract] This paper presents ibvdev a scalable and efficient low-level Java message-passing communication device over InfiniBand. The continuous increase in the number of cores per processor underscores the need for ...
    • Enabling Hardware Affinity in JVM-Based Applications: A Case Study for Big Data 

      Expósito, Roberto R.; Veiga, Jorge; Touriño, Juan (Springer, 2020)
      [Abstract]: Java has been the backbone of Big Data processing for more than a decade due to its interesting features such as object orientation, cross-platform portability and good programming productivity. In fact, most ...
    • Enhancing in-memory Efficiency for MapReduce-based Data Processing 

      Veiga Fachal, Jorge; Expósito, Roberto R.; Taboada, Guillermo L.; Touriño, Juan (Academic Press, 2018-10)
      [Abstract] As the memory capacity of computational systems increases, the in-memory data management of Big Data processing frameworks becomes more crucial for performance. This paper analyzes and improves the memory ...
    • Evaluation of Java for General Purpose GPU Computing 

      Docampo, Jorge; Ramos Garea, Sabela; Taboada, Guillermo L.; Expósito, Roberto R.; Touriño, Juan; Doallo, Ramón (IEEE Computer Society, 2013-07-01)
      [Abstract] The presence of many-core units as accelerators has been increasing due to their ability to improve the performance of highly parallel workloads. General Purpose GPU(GPGPU) computing has allowed the graphical ...
    • Evaluation of messaging middleware for high-performance cloud computing 

      Expósito, Roberto R.; Taboada, Guillermo L.; Ramos Garea, Sabela; Touriño, Juan; Doallo, Ramón (Springer U K, 2013-12)
      [Abstract] Cloud computing is posing several challenges, such as security, fault tolerance, access interface singularity, and network constraints, both in terms of latency and bandwidth. In this scenario, the performance ...
    • FastMPJ: a scalable and efficient Java message-passing library 

      Expósito, Roberto R.; Ramos Garea, Sabela; Taboada, Guillermo L.; Touriño, Juan; Doallo, Ramón (Springer New York LLC, 2014)
      [Abstract] The performance and scalability of communications are key for high performance computing (HPC) applications in the current multi-core era. Despite the significant benefits (e.g., productivity, portability, ...
    • Flame-MR: An event-driven architecture for MapReduce applications 

      Veiga, Jorge; Expósito, Roberto R.; Taboada, Guillermo L.; Touriño, Juan (Elsevier BV * North-Holland, 2016)
      [Abstract] Nowadays, many organizations analyze their data with the MapReduce paradigm, most of them using the popular Apache Hadoop framework. As the data size managed by MapReduce applications is steadily increasing, the ...